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Abstract — The problem of hypothesis testing against indepen- 
dence for a Gauss-Markov random field (GMRF) is analyzed. 
Assuming an acyclic dependency graph, an expression for the 
log-likelihood ratio of detection is derived. Assuming random 
placement of nodes over a large region according to the Poisson 
or uniform distribution and nearest-neighbor dependency graph, 
the error exponent of the Neyman-Pearson detector is derived 
using large-deviations theory. The error exponent is expressed as 
a dependency-graph functional and the limit is evaluated through 
a special law of large numbers for stabilizing graph functionals. 
The exponent is analyzed for different values of the variance 
ratio and correlation. It is found that a more correlated GMRF 
has a higher exponent at low values of the variance ratio whereas 
the situation is reversed at high values of the variance ratio. 



(b) Hq : Independent observations. 



(a) 7-L 1 : Gauss-Markov random field 
with nearest-neighbor dependency. 

Fig. 1. Illustration of the hypothesis-testing problem. 
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Index Terms — Detection and Estimation, Error exponent, Gauss- 
Markov random fields, Law of large numbers. 

I. Introduction 

FOr distributed detection, the so-called conditionally IID 
assumption is mathematically convenient and is widely 
assumed in the literature. The assumption states that condi- 
tioned on a particular hypothesis, the observations at sen- 
sors are independent and identically distributed. In practice, 
however, spatially distributed sensors often observe correlated 
data, since natural spatial signals have stochastic dependence. 
Examples of correlated signals include measurements from 
temperature and humidity sensors, or from magnetometric 
sensors tracking a moving vehicle. Audio data is also rich 
in spatial correlations, due to the presence of echoes. 

Spatial random signals are typically acausal in contrast 
to temporal signals. In the literature, the two are usually 
distinguished by referring to acausal signals as random fields 
(RF) and causal signals as random processes (RP). Random 
fields are of interest in a variety of engineering areas and 
may represent natural phenomena such as the dispersion of 
atmospheric pollutants, groundwater flow, rainfall distribution 
or the mesoscale circulation of ocean fields ||2) . 

In this paper, we consider the problem of hypothesis testing 
for independence, shown in FigJT] Specifically, under the 
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alternative hypothesis, sensors collect samples from a Gauss- 
Markov random field (GMRF), whereas the samples are in- 
dependent under the null hypothesis. We model the GMRF 
through a graphical approach, in which a dependency graph 
(DG) specifies the stochastic dependence between different 
sensor observations. This dependency graph can have different 
degrees of sparsity and can even be fully connected. However 
typically, spatial interactions are based on proximity, where 
the edges are included according to some specified rule based 
on the local point configuration (3], El- With a regular lattice 
structure (e.g., in image processing, Ising model), a fixed set 
of neighbors can be specified in a straight-forward manner 
|5|. However, the situation is more complicated for arbitrarily 
placed nodes. In this paper, we consider the nearest-neighbor 
graph (NNG), which is the simplest proximity graph. The 
nearest-neighbor relation has been used in several areas of 
applied science, including the social sciences, geography and 
ecology, where proximity data is often important (6|, Q. 

We consider the Neyman-Pearson (NP) formulation, where 
the detector is optimal at a fixed false-alarm probability. But, 
under this formulation, analysis of performance metrics such 
as error probability is intractable for an arbitrary number of 
observations. Hence, we focus on the large-network scenario, 
where the number of observations goes to infinity. For any pos- 
itive fixed level of false alarm or the type-I error probability, 
when the mis-detection or the type-II error probability Pm (n) 
of the NP detector decays exponentially with the sample size 
n, we have the error exponent defined by 



D:= — lim — \ogPnj(n) 



(1) 



The error exponent is an important performance measure since 
a large exponent implies faster decay of error probability with 
increasing sample size. 
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Additionally, we assume that the sensors observing the sig- 
nal field are placed i.i.d. according to the uniform or Poisson 
distribution. Since nodes are placed irregularly, it results in a 
non-stationary GMRF (for the definition of stationary GMRF, 
see (SI P. 57]). We assume that the number of nodes goes 
to infinity, by way of the coverage area of the nodes going 
to infinity, while keeping the node density fixed. Under this 
formulation, we derive the detection error exponent, assuming 
access to all the observations. 

A. Related Work and Contributions 

The kind of hypothesis testing we consider is called testing 
for independence. In |9), iflOl . problems of this kind are 
considered with rate constraints on the channels and for two 
sources, using a large number of samples at each source. In this 
paper, we assume that there are no constraints on the channel 
and that the observations have the correlation structure of the 
GMRF. Our formulation is different since there is a single 
observation at every sensor, and the number of sensors goes 
to infinity. 

GMRF is also known as conditional auto-regression (CAR) 
in the seminal work of Besag ifTTI . fl2l . They have a wide ar- 
ray of applications in fields such as speech recognition, natural 
language processing, coding, geo-statistics, image analysis and 
AI. The literature is too vast to mention here. For an exposition 
on GMRF, see ffl, lfl3l. 

Another related problem is the detection of Gauss-Markov 
random processes (GMRP) in Gaussian noise, which is a 
classical problem [14]. There is an extensive literature on 
the large-deviations approach to the analysis of detection 
of GMRP [15|-|25|, but closed-form expressions have been 
derived only for some special cases, e.g., Il26l - ll28l . GMRP 
has been characterized via inversion algorithms for block- 
banded matrices |29ll , 11301 . However, these approaches are not 
amenable to the extension of the problem to planar and higher 
dimensional spaces, since they deal with random processes 
rather than random fields, or to the random placement of 
nodes. 

Related to the GMRF, there is an alternative and more 
restrictive approach, known as the spatial auto-regressive 
model (SAR) and has been extensively studied in the field 
of spatial data-mining. In [7], this formulation is considered 
with (directed) nearest-neighbor interaction and a closed-form 
ML estimator of the AR spatial parameter is characterized. We 
do not consider this formulation in this paper. 

To our knowledge, large-deviation analysis of the detection 
of acausal non-stationary GMRF has not been treated before. 
We first express the likelihood function of a GMRF with an 
arbitrary acyclic dependency graph, in terms of its covariance 
matrix. The joint distribution can also be derived by expressing 
it in terms of the marginal probability of the nodes and the 
joint probability at the edges of the dependency graph OD . 

E2. 

We consider the detection problem represented in FigQ] 
under the additional assumptions of nearest-neighbor depen- 
dency. We consider the location of the sensors as a random 
point set drawn from uniform or Poisson distribution and 



defined on expanding regions. This framework allows us to 
exploit recent advances in computational geometry (331 , (34]. 
By casting the error exponent as a limit of the sum of graph 
functionals, we are able to apply the special law of large 
numbers (LLN) for functionals on graphs derived in l33ll . 
We obtain the final form of the exponent by exploiting some 
special properties of the NNG We then numerically evaluate 
the exponent for different values of the variance ratio and 
correlation, for exponential and constant correlation functions. 
We conclude that at a fixed node density, a more correlated 
GMRF has a higher exponent at low values of variance ratio, 
whereas the opposite is true at high values of variance ratio. 

B. Notation and Organization 

Vectors and matrices are written in boldface. Random 
variables are in capital letters, random processes and random 
fields in boldface capitals and sets in calligraphic font. For 
the matrix A = A(i,j) denotes the element in the 

i lh row and j ,h column and |A| its determinant. For sets A and 
B, let A\B = {i : i G A,i £ B} and let | • | denote cardinality. 

An undirected graph Q is a tuple Q = (V,£) where V = 
{1, 2, . . . , ri } is the vertexQ set and £ — i,j € V, i ^ 

j} is the edge set. When i and j have an edge between them, 
i and j are neighbors denoted by i ~ j (otherwise it is i j). 
For a directed graph, we denote the edges by £ = {< i,j > 
, i,j € V,i 7^ j}, where the direction of the edge is from i 
to j. The neighborhood function of a node i is the set of all 
other nodes having an edge with it, i.e., 

M(i) = {jeV:j^i,(i,j)e£}. (2) 

The number of neighbors of a node i is called its degree, 
denoted by Deg(i). A node with a single edge i.e., its degree 
is 1 is known as a leaf and the corresponding edge as a leaf 
edge, otherwise it is known as an internal or interior edge. Let 
dist(z, j) be the Euclidean distance between any two nodes. 
Let Rij denote the (random) Euclidean edge-length of 
in graph Q = (V,£), 

i^=dist(i,j), V{i,j)e£. (3) 

Our paper is organized as follows. We provide a description 
of the GMRF in section [Til focusing on the acyclic dependency 
graph in section [111] and providing an expression for the 
likelihood function in section UlI-AI We define the hypothesis- 
testing problem in section [TV] and specify additional assump- 
tion on the covariance matrix of the GMRF in section ITV-AI 
In section ITV-Bl we assume additionally that the dependency 
graph is the nearest-neighbor graph. We provide an expression 
for the log-likelihood ratio in section lTV-CI We define the error 
exponent under the Neyman-Pearson formulation in section [Vl 
and specify the random placement of nodes in section [VAl In 
section [VI] we evaluate the error exponent, expressing it as a 
graph functional in section [VI- AI applying the LLN for graphs 
in section IVI-B1 and providing an explicit form for NNG in 
section IVI-CI We provide numerical results for the exponent 
in section IVI-DI and section IVIII concludes the paper. 

'We consider the terms node, vertex and sensor interchangeable. 
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4 2 
(a) A labeled simple undirected 
graph. 



(b) X : Non-zero elements of 
potential matrix. 



Fig. 2. Dependency graph and potential matrix of a GMRF. 



II. Gauss-Markov Random Field 

A GMRF, in addition to being a Gaussian random field, 
satisfies special conditional independence properties. A simple 
example is the first-order auto-regressive process, where the 
conditional independence of the observations is based on 
causality. However, a spatial random field has a far richer 
set of conditional independencies, requiring a more general 
definition E P. 21]. 

Definition 1 (GMRF): Given a point set V = {1, ...,n}, 
Yy = {Yi : i e V} is a GMRF with an (undirected) 
dependency graph £ (V, £) if Yy is a Gaussian random field, 
and Vi, j € V, Yi and Yj are conditionally independent given 
observations at all other nodes if i and j are not neighbors, 
i.e., 



Y{ -L ^u"^"— ij 



(4) 



where _L denotes conditional independence and Y_ij: 
keV,k^i,j). 

A common approach to formulating a GMRF is to specify 
the dependency graph through a neighborhood rule and then 
to specify the correlation function between these neighbors. 
Thus, in a GMRF, local characteristics completely determine 
the joint distribution of the Gaussian field. 

The following Markov properties are equivalent in a GMRF: 

1) pairwise-Markov property 



Y$ -L Yj ij 



2) local-Markov property 



Yi -L Y -(i,jV(i))|YA/-(i), 
3) global-Markov property 

Y A ± Y B \Y C , 



(5) 



(6) 



(7) 



for disjoint sets A, B, and C, with A and B non-empty, 
where the set C separates A and B i.e., on removing the 
nodes in C from the graph, nodes in A are no longer 
connected to the nodes in B. 
Thus, in ©, the local-Markov property states that the condi- 
tional distribution at a node in the DG given the observations at 
its neighbors is independent of the rest of the network. By the 
global-Markov property in (|7), all the connected components 



of a dependency graph are independent. As an illustration, in 
Fig |2] we have Y" 6 _L Yj given the rest of network, Yi _L Y2IY3, 
and so on. 

III. Acyclic Dependency Graph 

A special case of the dependency graph is an acyclic 
or a cycle-free graph. Here, the neighbors of a node are 
not themselves neighbors. The joint distribution is somewhat 
easier to evaluate in this case. We note that an acyclic graph 
with at least one edge, always has a leaf i.e., it has a node 
with degree 1 and has utmost n — 1 edges in a n-node graph. 

The covariance matrix £ of a GMRF satisfies some special 
properties. For instance, consider the cross covariance between 
the neighbors of a node, i.e., nodes that are two hops awa 
in an acyclic DG. By the global-Markov property we havi 
assuming £(i,i) > 0, for i € V, Deg(i) > 2, j, k eJ\f(i),j ^ 
k, 



£0\*) = 



T,(i, j)£(i,fc) 
£(i,i) 



For example, in Fig|2] 



£(1,2) 



S(1,3)S(2,3) 
£(3,3) 



(8) 



(9) 



We can similarly find an expression for the covariance between 
any two nodes of the GMRF. Thus, the covariance matrix of 
a GMRF with acyclic dependency can be expressed solely 
in terms of the auto covariance of the nodes and the cross 
covariance between the neighbors of the dependency graph. 

A. Potential Matrix 

The inverse of the covariance matrix of a non-degenerate 
GMRF (i.e., with a positive-definite covariance matrix) is 
known as the potential matrix or the precision matrix or the 
information matrix. The non-zero elements of the potential 
matrix A are in one to one correspondence with the edges of 
its graph Q(V,£) (H Theorem 2.2] in the sense that 



l no j 



A(i,j) = 0, Vi,jeV,i^j, 



(10) 



and is illustrated in Figj2] 

This simple correspondence between the conditional inde- 
pendence of the GMRF and the zero structure of its potential 
matrix is not evident in the covariance matrix, which is 
generally a completely dense matrix. Therefore, it is easier 
to evaluate the joint distribution of the GMRF through the po- 
tential matrix. In practice, however, estimates of the covariance 
matrix are easier to obtain through the empirical observations. 
Therefore, it is desirable to have the joint distribution in terms 
of coefficients of the covariance matrix. Thus, an explicit 
expression between the coefficients of the covariance and the 
potential matrix is needed. We provide such an expression 
and also obtain the determinant of the potential matrix in the 
theorem below. 

2 For X, Y jointly zero mean Gaussian, E(X|y) = S^E^y. 
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Theorem 1 (Elements & Determinant of Potential Matrix): 
The elements of the potential matrix A:=S~ , for a positive- 
definite covariance matrix S and acyclic dependency graph 

G(V,£), are 



We denote the ratio between the variances under the alternative 
and the null hypothesis at each node by 



K; 



(17) 



A{i,i) 



£(M)V~ ' S(i,i)S0',i) - S(i,j) 2 /' 

-u/../) : < s^AAji-sfyj 1 lfz ~ J ' (id 



o 



The determinant of the potential matrix of A is given by 

iai = -L = „iv ev ,„r,' { d2) 



isi np(M)s(wi-E(M?r 

Proof: The proof is based on acyclicity of dependency 
graph. See Appendix lAl □ 

IV. Hypothesis-Testing Problem 

Let V = {1, . . . , n} be a set of n nodes on the plane and let 
Y„ be the random vector of observation samples Yi, i € V, 



Y„:=[Y 1 ,...,K„] T . (13) 
The hypothesis-testing problem is as follows (also see FigJTJl, 



Ho : Y„~AA(0,a 2 I) vs. Hi : Y„ ~ tf(0, S 1>v ), (14) 

where S^y is a positive-definite covariance matrix under the 
alternative hypothesis and is dependent on the configuration 
of nodes in V and CTq > is the uniform variance under the 
null hypothesis. 

The optimal decision-rule under both NP and Bayesian 
formulations is a threshold test based on the log-likelihood 
ratio (LLR). Let p[Y n \V]Hj] be the conditional PDF of the 
observations given the point set V under hypothesis j. The 
LLR given by 



ttd/v in i P[Y„,V;-H ] , p[Y n ;H } 
LLR(Y„,V) := ^ ^^ =^ ^v^] ' (15) 

= ^(log^+Y r T[S^-K 2 I)- 1 ]Y n ), 

where in ( [15] ), we have used the fact that the sensor observa- 
tions are independent of V under Ho- 

A. Covariance Matrix of GMRF 

We make additional assumption on the structure of the 
covariance matrix Si y of the GMRF under Hi viz., that 
the nodes have the same measurement variance for any node 
configuration V, i.e., 



We also assume that under Hi, the amount of correlation 
between the neighbors i, j of the dependency graph is specified 
by an arbitrary function g, which has the Euclidean edge length 
Rij as its argument. From ( fT6l ), we have 



< i, V (»,j) e£. 



(18) 



The correlation function g is required to satisfy some regularity 
conditions, which will be stated in Lemma [2] In general, g is 
a monotonically non-increasing function of the edge length, 
since amount of correlation usually decays as nodes become 
farther apart. Moreover, g(0) — M < 1, or the so-called 
nugget effect, according to geo-statistics literature (35), ll36ll . 
It has been observed in mining applications, where the micro- 
scale variation is assumed to be caused by the existence of 
small nuggets of the enriched ore. Many other ecological 
phenomena such as soil bacteria population [37], aquatic 
population 11381 etc. also exhibit this behavior. Note that the 
presence of nugget effect has the same effect on correlation as 
imposing an exclusion region on how near two nodes can be 
placed. However, for such an exclusion constraint to hold, we 
need more complicated node placement distributions than the 
uniform or Poisson assumption. Although such distributions 
can be handled in principle, they are not analytically tractable. 
Some examples of the correlation function are 

g(R) = Me~ aR , g(R) = a > 0,0 < M < 1. 

Note that these conditions together with an acyclic dependency 
graph Q assure that the covariance matrix Si y is positive 
definite. This is because V i,j G V, 



Hiy(i,i)Exy(j, j)-Y;l v (i, j) = af[l 



i - n 9 2 (Rki)\>o, 

O,0epath(ij) 

where path(i,j) is the unique edge-path between i and j in 
graph Q if it exists. From Theorem [Tj we have |S| > 0. 

B. Nearest-Neighbor Graph 

We assume the dependency graph to be the nearest-neighbor 
graph. The nearest-neighbor function of a node i 6 V, is 
defined as, 



nn(i): 



: arg min dist(i,j), 



(19) 



where dist(v) is the Euclidean distance. The inter-point 
distances are unique with probability 1, for uniform and 
Poisson point sets under consideration here. Therefore, nn(i) 
is a well-defined function almost surely. The nearest-neighbor 
(undirected) graph Q(V,£) is given by 



Ei v(i 5 i)'-— a\ > 0, i = 1, 



(16) 



(*,j)e£ 



nn(j) or j =nn(i). 



(20) 
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of directed NNG 



Directed NNG 
Undirected NNG 




Fig. 3. Directed & undirected versions of nearest-neighbor graph. 
[ No. of undirected edges = No. of directed edges - 0.5 x No. of 
biroots.] 



Fig. 4. Illustration of point process U n \ or V n \ : n nodes distributed i.i.d. 
uniform or Poisson in regular Borel regions (such as squares or circles) of 
area y with constant density A. For error exponent, we consider n — ¥ oo. 



NNG has a number of important properties. It is acyclic with 
a maximum^ node degree of 6 . 

In section IVI-C1 it turns out that we need to analyze the 
directed NNG, in order to obtain the final form of the error 
exponent. We now mention some of its special properties. The 
directed NNG G'(V, £') is defined by 



£' = {< i,nn(i) >,i e V}, 



(21) 



For a directed NNG with at least two nodes, each connected 
component contains exactly one 2-cycle. This is known as 
the biroot of the component [40]. See FigO Also note, the 
directed NNG counts the edges from these biroots twice, while 
the undirected version counts only once. 

C. Expression for Log-Likelihood Ratio 

Since the NNG is acyclic, equations ( fTTIfT2l are valid. We 
incorporate additional assumptions ( fT6lfT8b in the theorem to 
obtain the LLR for detection. 

Theorem 2 (Log-Likelihood Ratio): Under the assumptions 
( 1161181 . the log-likelihood ratio in ( fT5l ) for the hypothesis- 
testing problem in ( fT4l >. given an arbitrary point set V = 
{1, ...,n},is 



LLR(Y„,V) 



nlog 



+ J2 {iog[i-P 2 (%)] 



Y- 

2 I t 



(i,j)e£ 

i<j 



9 2 {Rq) Y ? + Y* 
l-g 2 (Rij) u\ 
2g{R ij ) YjYj 



tiRX 

of / 



(22) 



where Rij is the Euclidean edge length of € £, that 

depends on the configuration ofV. The condition i < j ensures 
that every edge is counted only once. 

Theorem [2] gives a closed-form expression for the log- 
likelihood ratio, in terms of the edges of the nearest-neighbor 
dependency graph of the GMRF Note in (l22l . the cross-terms 
are only between the neighbors of the dependency graph, 
which can be exploited to yield explicit data-fusion and routing 
schemes BP . 

3 The node degree is finite for NNG in any dimension and is called the 
kissing number 1391. 



V. Neyman-Pearson Error Exponent 

The spectrum of the log-likelihood ratio is defined as the 
distribution of the normalized LLR evaluated under the null 
hypothesis. In |25l Theorem 1], it is proven that for Neyman- 
Pearson detection under a fixed type-I error bound 0, the LLR 
spectrum can fully characterize the type-II error exponent of 
the hypothesis-testing system and is independent of the type-I 
bound. 

A special case of this result is when the LLR spectrum 
converges almost surely (a.s) to a constant D 



1 



LLR(Y„,V) 



1 p[Y n ;U ] a.s. 



To 



p[Y n \V;Hi 



D, undergo. 



In this case, the NP type-II error exponent is given by the 
above constant D. In other words, the error exponent D of 
NP detection in ((TJ is 



D:= lim I log f^"] 



under Hr 



(23) 



where lim denotes the almost-sure limit, assuming it exists. 
Note that when Y n are i.i.d. conditioned under either Hq 
or Hi, the result reduces to the Stein's lemma [43 Theorem 
12.8.1] and the limit in ( l23l to the Kullback-Leibler distance. 

A. Random Point Sets 

It is intractable to evaluate the error exponent D in d23l for 
an arbitrary point set. Therefore, we assume that the nodes 
are placed randomly, according to a point process defined on 
expanding regions. We consider two related point processes : 
the Poisson process and the binomial point process on a large 
region, which we define below. 

Definition 2 (Poisson and Binomial Processes H44\l ): 
Let (B n ) n >i denote a sequence of squares or circled of 
area centered at the origin, for any A > 0. A binomial 
point process on B n , denoted by U n .\, consists of n points 
distributed i.i.d. uniformly on B n . A homogeneous Poisson 
process of intensity A on B n , denoted by V n ,\, satisfies the 
following properties: 

1) for any set A C B n with area A, the number of points 
in A is Poisson distributed with mean \A, 

The generalization to an exponential type-I error bound 1 25], 1 42 1 is not 
tractable since a closed-form cumulative distribution of the LLR is needed. 
5 The results hold for regular Borel sets under some conditions [4 P. 1007]. 
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Normalized sum of edge weights 



Expectation of edges 
of origin of Poisson process 

iE Y. <««o,x) 

(x,o)es(xuo) 



Fig. 5. Pictorial representation of LLN for graph functional of uniform or 
Poisson point sets. 



2) for any n £ N and A C B n with area A > 0, conditioned 
on n number of points in A, the point process on A is 
a binomial process. 

We are interested in evaluating the error exponent under 
both the binomial or Poisson point processes, when the mean 
number of nodes goes to infinity, with fixed node density, i.e., 
n — » oo with A fixed. 



VI. Closed-Form Error Exponent 

A. Error Exponent as a Graph Functional 

In order to derive the error exponent, we cast the error 
exponent as the limit of sum of node and edge functionals 
of the dependency graph of a marked point set in the lemma 
below. This formulation is required in order to apply the law 
of large numbers for graph functionals. 

Lemma 1 (D as a Graph Functional): Given the marked 
point set V drawn from the binomial process U, h \ or the 

Poisson process V n ,\, with marking variable Yi 7V(0, cr 2 ), 
the error exponent D in (1231 is given by the limit of sum of 
edge and node functionals of the nearest-neighbor graph as 



D = 



log — + lim — 

(Jq n-s-oo Zn 



iev 



]T {log[l-g 2 (R ij )] + - 

j 

2g(R l0 ) YiYj 



Kj 



-9 2 (Rv) °\ 



l- 5 2 (i? 4 ,) a\ 



-} 



, y 4 l ~ AA(0,a 2 ), (24) 



where Rij is the (random) Euclidean edge length of(i,j) G £, 
that depends on the underlying point process. The condition 
i < j ensures that every edge is counted only once. 
Proof : Substitute ([22) in d23l . □ 
In the lemma above, the point set forming the graph is drawn 
from a marked binomial or Poisson point process, with the 
marking variable Yi l Af(0, <7q). This is because evaluating 
the error exponent (1231 under Ho implies that the sensor 
observations Y^ are i.i.d. and independent of the locations of 
the nodes, and therefore can be viewed as a marking process. 



B. Law of Large Numbers for Graph Functionals 

The law of large numbers for functionals on graphs enables 
us to evaluate the limij^l in d24l . This law applies to graphs 
which are random in the sense that the vertex set is a marked 
random point set. LLN on graphs is based on the so-called 
objective method. Steele [45 1 coined this term for a philosophy 
whereby, loosely speaking, one describes the limiting behavior 
of functionals on finite point sets of binomial process in terms 
of related functionals defined on infinite Poisson point sets. 
Also see Fig|5] Penrose and Yukich 04), ll33l . Il34l introduce a 
concept of stabilizing functionals and use the objective method 
to establish a strong law of large numbers for graph functionals 
ll33l P. 287]. In order to apply this law, some conditions need 
to be satisfied in terms of bounded moments. In the lemma 
below, we place these conditions on the correlation function. 

Lemma 2 ( Conditions for LLN): The graph functional in 
(l24l satisfies the conditions for law of large numbers for 
graph functionals derived in [33 P. 287], when the correlation 
function g is monotonically non-increasing with the edge- 
lengths, g(oo) = 0, and g(0) — M < 1. Hence, the graph 
functional in d24l) converges almost surely to a constant. 
Proof : See appendix 151 

Theorem 3 (LLN): Under the conditions stated in Lemmaf2] 
for nodes placed according to li n< \ or V n ,\, with node density 
A and region area j, from the law of large numbers for graph 
functionals, the expression for the error exponent D in (l24l 
for Neyman-Pearson detection of the GMRF defined by the 
NNG is given by 



2 



where 



2 



2^ /(ff(*o,x)) 

XiXeTV 
(0,X)GNNG(XUO) 



f(x):=\og{l-x 2 } + 



■ \ogK + — -l 

A 



2x 2 



(25) 



K[l 



(26) 



K is the ratio of variances defined in ( fl7| i, and i?o.x are 
the (random) lengths of edge (X, 0) incident on the origin 
in a NNG, when the nodes are distributed according to 
homogeneous Poisson process V\, of intensity A. 
Proof : Apply LLN to (f24]l. See appendix |C] □ 
In the theorem above, the law of large numbers yields 
the same under the Poisson or the binomial process. 

Thus, we provide a single expression for the error exponent 
under both the processes. Also, the above theorem provides 
the error exponent in terms of the expectation of a graph 
functional around the origin, with the points drawn from an 
infinite Poisson process. Thus, the functional is reduced to 
a localized effect around the origin. This is an instance of 
the broad concept of stabilization which states that the local 
behavior of the graph in a bounded region is unaffected by 

6 Nature of convergence is convergence of means and complete convergence 
(c.m.c.c) and implies almost-sure convergence. 

7 In general, the limit is not the same for Poisson and binomial processes. 
For a different problem, we show that the error exponents are affected by a 
random sample size |46|. 
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points beyond a finite (but random) distance from that region. 
NNG is one such stabilizing graph with translation and scale- 
invariance [4, Lemma 6.1]. 



and region area j, the error exponent D for Ney man-Pearson 
detection is 



C. Explicit Form for Nearest-Neighbor Graph 

The evaluation of the expectation of the edge functional in 
( |25] l is complicated and needs further simplification. In order 
to obtain the final form of the exponent, we exploit some 
special properties of the NNG. It turns out that the expectation 
term is easier to evaluate for the directed nearest-neighbor 
graph rather than the undirected version. We therefore split the 
sum of edge functionals in d24b . using the fact that the directed 
NNG counts the weights from biroots or mutual neighbors 
twice, while the undirected version counts only once. See 
Figf3] We therefore split the sum of the edge functionals of 
the undirected NNG as 



eGNNG(V) 



E f(s(Re)) 
eSDNNG(V) 

~\ E f(9(Re)), (27) 

eGMNNG(V) 



where NNG(V), DNNG and MNNG C DNNG are the undi- 
rected NNG, the directed NNG, and edges between the biroots 
or the mutual neighbors of the directed NNG, respectively. 
Now, we evaluate the expectation for the two terms separately, 
since expectation is linear. A similar approach is employed in 

eo. 

We now provide an expression for the limit of the edge 
functional based on the distribution of distances of the directed 
NNG, which are related to hitting or vacancy probabilities of 
the spatial point process, which are typically exponential or 
gamma distributed, similar to their one-dimensional counter- 
parts EH). 

Lemma 3 (Expectation of Edge Functional): The expecta- 
tion term of the edge functional in d25l > is given by 

±E E f(9(Ro^))=Ef(9(Zi))- ^-Ef(g(Z 2 )), (28) 



(0,X)GNNG(XUO) 



where Z\ and Z 2 are Rayleigh distributed with variances 
(27rA) -1 and (2o;A) _1 , and to is given by 



- = Y + — «5.06, 



(29) 



and is the area of the union of two unit- radii circles with 
centers unit distant apart. 

Proof : See appendix iDl □ 
In the theorem below, we combine Lemmas |2] [3] and 
Theorem [3] to obtain the final form of the error exponent. 

Theorem 4 (Expression for D): Under the assumptions 
( 1161118b and conditions stated in Lemma |2 for a GMRF with 
NNG dependency and correlation function g and nodes drawn 
from the binomial or the Poisson process with node density A 



D g (K,M,X) = ^[Ef(g(Z 1 ),K)-^ ; Ef(g(Z 2 ),K) 



logiiT+i-l], 



(30) 



where 



/(i,A-):=log[l 



2a; 2 



K\l 



(31) 



Zx and Z 2 are Rayleigh distributed with second moments 
(27TA)- 1 and (2cjA) — 1 . 

The above theorem holds for any general correlation func- 
tion. In d30l l. except for the first two /-terms which capture 
the correlation structure of the GMRF, the remaining terms 
represent the detection error exponent for two IID Gaussian 
processes. In the corollary below, we specialize ( f30b to the case 
of constant correlation. In this case, the two /-terms reduce 
to a single term. 

Corollary 1 (Constant Correlation): For constant values of 
the correlation, the error exponent D is independent of the 
node density A and 

1) for constant positive correlation or g(R e ) = M < 
1, Ve G £ , we have 



D(K, M) 



1 r 
2 



log if 



(1 



2u 



K 

)f(M,K) 



(32) 



where / and uj are given by $2M and 
2) for the independent case or g{R e ) = 0, Ve G £, we have 



D(K,0) = ~[logK+±-l 



(33) 



In the above corollary, we verify that ( l32l reduces to d33l ). 
on substituting M = 0. In ( T3"2j >, the effect of correlation can 
be easily analyzed through the sign of the function f(M, K). 
Also, 



f(M,K) 



< 0, 

> o, 



for K > 
for K < 2. 



(34a) 
(34b) 



Therefore, at large variance-ratios, the presence of correlation 
hurts the asymptotic performance, when compared with the 
independent case. But the situation is reversed at low values of 
the variance ratio and the presence of correlation helps in de- 
tection performance. In the next section, we will draw similar 
conclusions when the correlation function is the exponential 
function through numerical evaluations. 




K in dB K in dB 

(a) Different values of correlation coefficient a, nugget M = 0.5. (b) Different values of nugget M, correlation coefficient a = 0.5. 

Fig. 6. Error exponent D vs. ratio of variances K, node density A = 1. See 132135) . 



D. Numerical Results 

In this section, we focus on a specific correlation function 
namely the exponential-correlation function, 

g{R) = Me~ aR , a > 0, < M < 1. (35) 

Using Theorem |4j we numerically evaluate D through Monte- 
Carlo runs. In fl30l ), the error exponent is an implicit function of 
the correlation coefficient a, through the correlation function 
g. For fixed values of K and M, we have 

D(K, M, A, a) — D(K, M, 1, -?=), (36) 

v A 

which we obtain by changing the integration variable in the 
expectation term in (f30b - Therefore, in terms of the error 
exponent, increasing the node density A is equivalent to a 
lower correlation coefficient at unit density. Here, we plot only 
the effects of correlation coefficient a and nugget M on D. 

In Fig|6ja), we plot the error exponent at A = 1 and 
M = 0.5, for different values of correlation coefficient a. 
Note, the cases a — and a — ► oo correspond to ( T32l i and 
P3t . We notice that a more correlated GMRF or the one with 
smaller a, has a higher exponent at low value of K, whereas 
the situation is reversed at high K. Equivalently, increasing 
the node density A improves the exponent at low value of K, 
but not at high K. Also, when the variance ratio K is large 
enough, D appears to increase linearly with K (in dB), and 
the correlation coefficient a and nugget M appear to have 
little effect, as expected from Theorem @] In Fig|(Jb), we plot 
the exponent at constant correlation coefficient a = 0.5 for 
different values of the nugget M. Also note, M = reduces 
to the independent case. We notice a similar behavior as the 
correlation coefficient. A higher value of M results in a higher 
exponent at low K, but not at high K. 

VII. Conclusion 

In general, finding the closed form detection error exponent 
is not tractable. The graphical structure of the Markov random 
field allows us to exploit existing results in spatial probability 
literature. We employed the law of large numbers for graph 



functionals to derive the detection error exponent for a Gauss- 
Markov random field with nearest-neighbor dependency graph. 
We then investigated the influence of model parameters such 
as the variance ratio and the correlation function on the error 
exponent. 

In this paper, we have assumed identical variance at every 
sensor. However, a spatially varying SNR model can be 
incorporated into our results. We have focused on the GMRF 
defined by the acyclic dependency graph and derived the 
exponent for the nearest-neighbor graph. This is a simplifying 
assumption. Although, the law of large numbers is valid for 
a number of proximity graphs, which have edges between 
"nearby" points, the actual evaluation of the log-likelihood 
ratio and the exponent are intractable for most of these graphs. 
We have not considered correlation under null hypothesis for 
which one requires a LLN with correlated marks. We have 
also not considered the case when the signal field is not 
directly observable, resulting in a hidden GMRF. The sparse 
structure of the potential matrix is no longer valid under such 
a scenario. However, note, GMRF with small neighborhood 
has been demonstrated to approximate the hidden GMRF [49 1 
as well as Gaussian field with long correlation lengths f50l . 
reasonably well. 

The error exponent can be employed as a performance 
measure for network design. In BTI . we utilize the closed 
form derived in this paper to obtain an optimal node density 
that maximizes the exponent subject to a routing energy con- 
straint. We have also proposed minimum energy data fusion 
and routing schemes that exploit the correlation structure 
of Markov random field in a related publication |41|. We 
further investigate tradeoffs between the routing energy and 
the resulting error exponent in ll52l . 
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Appendix 

A. Proof of Theorem [7J 
Using the expression AS 



I, we have the following 



identities: 



A(i,i) 
A(i,i) + A(i,j) 



E A ^i) 



E A ^ k ) 



(37) 



S(i,j) 
0, Vj e JV(i), (38) 



where ( f37T > is obtained by the sum-product of z"' row and 
i' h column of A and X. Similarly, d38l is obtained by sum- 
product of I th row of A and f h column of £ and dividing by 
In d38l . by acyclicity for k £ Af{i) and k ^ j, we 
have j rp k. From (O, we have 

S(j,fe) £(i,fc) , 

Subtracting (38) from (37), only the terms with A(i,j) survive 
and hence, we obtain A(i,j). Substituting all the A(i,j)'$ in 
( |37l i, we obtain A(i,i). Hence, all the coefficients of potential 
matrix A are given by (fTTT i. 

Let |A( n )| be the determinant of the potential matrix of n 
nodes. Assume n > 1, since we have |AW| = S(l, l)" 1 . 
The determinant of the potential matrix is the product of 
determinants of the connected components. We therefore con- 
sider only one component Q'(V',£') C Q. Assume Q' has 
at least one edge, otherwise we have for diagonal matrix 
l A(n) l = LTiGV Since Q' is acyclic, it has a leaf, 

i.e., there is some vertex a with degree 1. Let b be its 
only neighbor. We assume the vertices have been ordered 
V = {Vx,...,V n } so that V n -i = b,V n = a. Then A<") 
has the following form 



AW = 







••• • A(n-l,n-l) A(n-l,n) 
••• A(n,n-T) A(n,n) 

where we have from (fTTT i. 



A(n, n) 
A(n - 1, n) 
A(n - 1, n - 1) 



S(ri -1,71-1) 



[S(n, ra)E(n - 1, n - 1) - S(n, n - l) 2 ] ' 

-S(n, n - 1) 
[S(n, n)S(n - 1, n - 1) - S(n, n - l) 2 ] ' 

1 . , S(n, n-1) 



S(ti - l,n- 1) 



- A(n - 1, 



S(ti - 1, n- 1) 



where C represents contributions from nodes in V'\V n i.e., 
with node V n removed, and having an edge with T4_i- 
Multiplying the n lh column by 

A(n, n— 1) — S(n, n— 1) 
A(n,n) S(n — l,n— 1) 

and subtracting it from (n — l) lh column and using the 
determinant rule, we have 



|A (n) | = 





• A'(n — l,n — l) A(n— l,n) 
A(n,n) 



(39) 



where 



A'(n- :=A(n - l,n- 1) 

S(n, n — 1) 



S(n — 1, n — 1) 



A(n,n~ 1).(40) 



Hence, we have 



|A (n) | = A(n,n)\M n \, for n > 1, 

where M„ is the minor of A(n, n) in (|39l . Substituting in 
(|40| i, we have A'(n— l,n— 1) = C, where as noted before, 
C is the contributions from nodes in V'\V n and having an 
edge with V n -\- This implies that A'(n— l,n — 1) is the 
coefficient in the potential matrix for the subgraph induced by 
V'\V n . Since only V n -i has an edge with V n , coefficients of 
nodes other than V n and V n -i are unaffected by the removal 
of V n - Hence, M„ is the potential matrix for the subgraph 
induced by V'\V n , 

M n =A {n - 1) . 

Since V'\V n is acyclic, a leaf is always present, rearrange the 
rows such that A'™ -1 ) has a leaf in the last two rows, i.e., 
it has the same structure as in (l39l . Remove a leaf in each 
step of the recursion, until all the edges are removed, then 
find the determinant with the diagonal matrix consisting of 
the remaining nodes and we obtain dl2l . □ 

B. Proof of Lemma [2] 

We can regard K;'s as marking, since under Ho they are i.i.d. 
independent of spatial point process. The strong stabilization 
condition is satisfied for NNG P. 1023, Lemma 6.1]. We 
therefore only need to prove the uniform bounded moment 
condition. We express the edge functional as the sum of two 
functionals, for i ~ j, given by 



h(R tJ ) ■■= -log[l- 5 2 (%)], 



g\R tJ )[Y? + Y^-2 9 {R tJ )Y l Y 3 



(41) 
(42) 



Given a finite marked set V, the sum functional is denoted by 
H i.e., 



H k (V):= E 



k = 1,2. 



(43) 



(ij)GNNG(V) 

i,jev 



Given Hk, we denote the add one cost 11331 (3.1)], which is 
the increment in H, caused by inserting a marked point at the 
origin into a finite marked set V, by 
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A k (V):=H k (VU{0})-H k (V). 



(44) 



Hi satisfies the polynomial-bounded condition [33, (3.3)], 
since fa in (l4TT i is a finite function, and the number of edges 
in NNG is at most n— 1, for n points. However, the functional 
H 2 does not satisfy the polynomial-bounded condition since 
the measurements Y; in d42b are unbounded. Instead, we define 
truncated random variable Z as 



Z:= 



sgn(Y)Clogn, 



if |V| < Clogn, (45a) 
o.w, (45b) 



where sgn is the sign function and C > is a constant. 
Consider the functionals H' 2 , (j>' 2 by replacing Y n with Z„ in 
H 2 and fa respectively. Now, H' 2 is polynomially bounded. 
Further, we have lirrin^oo Z ^4' Y and hence, lirxin^oo (H 2 — 

h 2 ) a A- 0. 

Definition 3 (Uniform Bounded Moments for fa): Define 
Mm, A to be m uniform random variables on A E B and i?o.x 
to be the (random) lengths of the edge (0, X) in graph Q 
incident on the origin. Then, the bounded p-moment condition 
(3.7)] 



sup 

AeBfleA, 



sup 

r A I A 3AIA 



E 



: E • 

(o,x)eS 
xew m ,A 



; (^o,x)F 



< oo, k — 1,2, 



(46) 



is true for some p > 1. 

Without the above condition, nothing can be said about the 
almost sure convergence, although, by Fatou's lemma, the limit 
of the LLN would be a bound on D. 

Since fa and fa are decreasing functions edge length, with 
maximum at zero, we have 

E[ Y, MRo^W < C p E[fa(0)} p , fc = 1, 2, Vp > 0, 

(O.X)GNNG(X) 
XGW m ,A 

where C is the kissing number, a constant, and Deg(O) < C 
for the NNG. Now, fa{Q) = - log[l-M 2 ] < oo, since g(0) = 
M < 1, and 

MP 



E[fa(oy] < 



(1 - NP)p 



E[Yi - YjfP < oo, 



since Yi , Yj 



' Af(0, Uq). Hence, the uniform-bounded 
moment for fa in (|46T > holds. 

Now, we show the uniform-bounded moment for H 11331 
(3.2)], obtained by replacing fa in ( l46b by Afe. The positive 
part of Afc is bounded by Deg(O)0fc(O), whose expectation is 
shown to be finite. For the negative part A k (l( m ,A)~> along 
the lines of |]4] Lemma 6.2], let l{nn(i) = 0} be the event 
that the origin is the nearest neighbor of i e U m .A- Then, 
the number of deleted edges on adding the origin is given by 
££Lil{m(i) = 0} < C, we have A k (U m , A )- < Cfa(0), 
whose expectation is shown to be finite. Hence, the bounded- 
moment condition for H holds and LLN is applicable. □ 




Fig. 7. Illustration of the event that the origin is a biroot in the directed 
NNG. This implie s th at there is no other point in the union of the circles 
shown above. See i53\ . 



C. Proof of Theorem \3\ 

We have the distribution of Y„ under the null hypothesis 



p[Y n \V;H Q } = 



\ 2al ) 



Therefore, the limit of the determinant is given by 
log|E ,v| , 

llm = log (To- 

ri— >oo in 



(47) 



We have E,"=i ¥ ^ W?M = a 2 a.s. under H . 
Therefore, the term in (l24l 



n 



1 \Y? 



- 1. 



By Lemma 12 the conditions for LLN hold and therefore as 

n — > oo, 



eSNNG(V) 



-E 



i E 

n t-^ 

(tj)eNNG(V) 
i<3 



- log[l-5 2 (i?e)] 

log[l-5 a (i2o,x)], (48) 

X:XGPa 

;o,x)eNNG(x) 

\R t3 )[Y^+Y^]-2g{R ll )Y^ 

(0,X)SNNG(X) 

where, in d49b we first take the expectation over IV s and 
use the fact that E[ y °+ 3 y ^ ] = 2(f-) 2 and E[Y" y x ] = 0. 
Collecting all the terms we have (ff5^. □ 

D. Proof of Lemma \3\ 

We use an approach similar to ll47ll . Let B Z (X.) denote a 
circle of radius z, centered at X. We take expectation on both 
sides of (|27| > for graphs over all the Poisson points XU 0. Let 
NNG(V), DNNG and MNNG C DNNG be the undirected 
nearest-neighbot graph, the directed nearest-neighbor graph, 
and edges between the biroots or the mutual neighbors of the 
directed nearest-neighbor graph. See Figf3] 
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n E /(s(flo,x))]=E[ E f(a(Ro, x ))] 

X:X6Pi X:X6Pj, 
(0,X)sNNG(X) (0,X)eDNNG(X) 

-\n E /(ff(^,x))]. (so) 

X:X6P A 

(0,X)sMNNG(X) 

The first term on the right-hand side in ( f50b simplifies as 

E[ E f(9(Ro,x.))]=nf(9(Zx))}, (51) 

X:X£-P A , 
(0,X)GDNNG(X) 

where Zi is the unique directed nearest-neighbor distance of 
the origin with points distributed according to V\, the Poisson 
point process of intensity A on 5ft 2 . The random variable Z\ 
is like a waiting time, and can be visualized as the time taken 
for an inflating circle to first touch a point from the Poisson 
process. We therefore have Z x > z iff. B z (0) does not contain 
any points from the Poisson process, i.e., 

P[Zi >z]= P[$X ^ £ B z (0) n "Pa] = e" X7VZ ' 2 . (52) 

Therefore, Zi is Rayleigh with second moment (27rA) _1 . 

Similarly, for the second term, we need to find the PDF 
of the nearest-neighbor distance of the origin when the origin 
is a biroot or a mutual nearest neighbor. This event occurs 
when the union of the circles centered at origin and its nearest 
neighbor contains no other Poisson point. See Figj7] Let A be 
the intersection of the events that the directed nearest-neighbor 
distance of origin lies in the interval [z,z + dz] and the event 
that origin is a biroot 

A := (P A n(B 2 (O)UB 2 (nn(O)))\{O,nn(O)} = 0) 

n(Z x € [z,z + dz]). (53) 

Its probability is given by, 



From ([271 [52] [56]), we obtain d2S}. □ 
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F[A] = P(origin is biroot|Zi)P(Zi € [z, z + dz}) 

= e-^- n ^ z2 2\irze- Xnz2 dz (54) 

= 2\Trze-" Xz2 dz = -[2ujttz&- uXz2 dz] (55) 

= -P(Z 2 € [z,z + dz]), (56) 
to 

where nn(0) is the nearest-neighbor of the origin and 
u:=\Bi(0) U Bi(l)\ = $f + the area of the union of 
circles unit distant apart and Z2 is a Rayleigh variable with 
variance (27ru;) _1 . Hence, the second term on the right-hand 
side in d50b simplifies as 

~E[ E /Grt*o,x))] = ^m[f(g(Z 2 ))}. (57) 
X:Xep A 

(0,X)eMNNG(X) 
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